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Abstract 

We show that the axioms of Weak Kleene Algebra (WKA) are sound 
and complete for the theory of regular expressions modulo simulation 
equivalence, assuming their completeness for monodic trees (as conjec- 
tured by Takai and Furusawa). 

1 Introduction 

Kleene algebra ^5] completely axiomatizes (the equational theory of) regular 
expressions modulo trace equivalence. Here, we show that Weak Kleene Algebra 
(WKA) completely axiomatizes regular expressions modulo simulation equiv- 
alence, assuming that they are complete for the equational theory of monodic 
trees. Their completeness for monodic trees was claimed by Takai and Furusawa 
[S], but later errors in their proof were discovered and have not yet been fixed. 

By contrast, Horn axiomatization of regular expressions modulo hisimulation 
equivalence remains an open problem. 

2 Regular Expressions and Simulation 

Milner [6] proposed the following process interpretation of regular expressions, 
typically presented as a structured operational semantics over the relations x [ 
(meaning that x is in an "accepting" state) and x—>-y (meaning that state x can 
evolve into state y through an a-labeled transition), where a is a metavariable 
ranging over transition labels. We present the theory using calculus notation: 
x' is the set of terms y such that x-^ y. (The transition rules and proofs are 
uniform in the transition label, so proofs are given only for a.) As usual, we 
omit the product operator in writing regular expressions (i.e., we write x ■ y as 
X y) . We extend the regular expression operators to sets of terms by pointwise 
application, and cast terms to singleton sets when necessary. Finally, we extend 
a; I to a prefix operator on sets: x| 5 is S" if a; J,, and is the empty set otherwise. 
Using these conventions, Milner's semantics (as presented in [4l) is as follows: 
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(Note that x' is always finite (by structural induction on x).) A binary relation 
ii is a simulation iff 

(Va;, y : R{x, y) ^ {x I ^ y I) A {\/a : R{x', y'))) 

where R has been extended to sets by 

R{S, T) ^ (Vx e 5 : (3 y e T : R{x, y))) 

Define < as the maximal simulation relation, and simulation equivalence = by 
x = y^x<yAy<x. Note that simulation equivalence is weaker than 
bisimulation equivalence, which requires the two simulations to be inverses. 
For example, a b + a {b + c) and a (6 + c) are simulation equivalent, but not 
bisimulation equivalent. 



3 Weak Kleene Algebra 

Weak Kleene algebra (WKA) is given by the following axioms over the language 
of regular expressions (where x <y abbreviates x + y = y). 



\- {x + y) + z = X + {y + z) 


(1) 


\- X + X = X 


(2) 


\-x + y = y + x 


(3) 


\- x + = x 


(4) 


\- 1 X = X 


(5) 


\- X 1 = X 


(6) 


\-0x = 


(7) 


h (x y) z = x {y z) 


(8) 


\-{x + y)z = xz + yz 


(9) 


\-xy + xz<x {y + z) 


(10) 


\- x* = l + x x* 


(11) 


y X < x\- y*x < x 


(12) 


x {y + 1) < X \- X y* < X 


(13) 



We write \- x = y to mean that x = y is provable from the WKA axioms. It is 
easy to prove using these axioms that < is a partial order, and that the regular 
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operators are monotonic wrt. <. A number of similar process algebras and 
Kleene-like algebras (e.g. [8]) have been proposed, but lacking the weak right 
induction law (|13p. which was first proposed in [2]. The WKA axioms were 
independently proposed for probabilistic programs [T] and for monodic regular 
tree languages j)J. 



4 Soundness 

We prove that the WKA axioms are sound for simulation — 

(hx = y) ^ x = y (14) 

— as follows. Given a set of axioms, take the set of instances of these axioms 
whose hypotheses are satisfied by < , and in each conclusion replace x < y with 
xRy and replace x = y with xRy A yRx, where i? is a fresh binary relation 
symbol. Let R be the weakest relation satisfied by these conclusions; we prove 
R C < (which implies that < is closed under the axioms, and hence the axioms 
are sound). Defining 



R ^ R U< 

:< ~ the reflexive, transitive closure of R 

X ^ y <^ X <y /\y < X 

S <T ^ (Vx G : (3 y e T : x ^ y)) 

S r^T ^ S <T AT <S 



we show i? C < as follows: 



i? C < 

^ C < 

^ is a simulation 
(Vx,?/ : x^ y^{xi 
{yx,y : xRy (xj, = 
(Vx,?/ : xRy (xj, = 



»2/i)A(Va:x'^2/')) 

yi)A(Va:x' ^y')) 
yl)AiVa:x' <y')) 



{def < } 
{def simulation } 
{transitivity } 
{< a sim, < C ^} 



Thus, to show soundness of H h x < y for simulation, we assume H and 
show xi ^ yi and x' ^ y' . We do this (for all instances of) one axiom at a 
time, so we can use instances of earlier axioms (since we have already proved 
that they are sound for simulation). Since H t- xRy (by definition of i?), we 
can also use instances of the axiom being proved, but with < and = in the 
conclusion replaced with ^ and ~. Note also that while ' is <-monotonic (by 
the definition of <), we cannot assume ' is ^-monotonic. 

With this preparation, the soundness proofs (presented in section (R)) ') are 
mostly routine, except for the right-induction axiom (jl3p . whose proof we give 
here (even though it occurs logically after the others). This axiom is the most 
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interesting because its soundness depends on the finitcncss of x' . To see why, 
suppose X is constructed as an infinite sum x — (+71 : (a + 1)"), and let y be 
a. Then x (y + 1) < but it is not the case that x y* < x. [x y* has infinite 
derivation chains, while x does not.) 

The proof of (|13p is as follows: assuming a; (y + 1) < x, 

{xy*)l => {{u.v)i} 

(x y*y = {{u v)' } 

x'y*Llxly'y* = {distr } 

{x'Uxly')y* < {x' ^ x' 1 <x' {y+1) §1,^ } 

{x' {y + l)lJxiy')y* = {def of ' } 
{x {y + I)) y* < {x (l + y) < X (hyp); monotonicity} 

x' y* :< {proof below } 

x' 

To prove x' y* ^ x', define the binary relation Q on (the set) x' by 

uQv ^ (3 71 > : u (y + 1)" < w) 

Note uQv ^ u < V A u {y + 1) < v (by monotonicity), and that Q is total: 

x' (y+l) < {dcf < } 

x' iy+l)Uxi{y + l)' = {def of ' } 
ix{y + l)y < {a: (y + 1) < .X (hyp)} 

x' 

Because Q is a total, transitive relation on a finite set, for each u € x' , there is 
a V such that uQv A vQv; hence u y* :< v: 

u y* < {uQv, so u < V } 
V y* di {vQv, so V (y + l) < v; ^ } 

V 

Thus, for every u ^ x' , u y* ^ x\ so x' y* :< x'. 



5 Monodic Tree Languages 

A monodic tree language is a set of first-order terms over a first order language 
with a single variable □. For a term t and set of terms 5, t{S) is the set of 
terms obtained from t by replacing each variable instance with an element of S. 
(Thus, if t has n variable instances and S is finite, t{S) is a set of jS*!" terms.) 
Extend this to sets by T{S) = {UteTt{S)). 

An interpretation (notation: /, J) is a function from regular expression sym- 
bols to tree languages. Extend interpretations to regular expressions as follows 
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(when applied to tree languages, = denotes ordinary set equality): 



^(0) = {} 

iW = {□} 

I{xy) = Iix){Iiy)) 

I{x + y) = Iix)Ul{y) 

I{x*) = U„>o/((l+x)") 



In [9] , Takai and Furusawa prove that the WKA axioms are sound and complete 
for the equational theory of regular expressions interpreted as tree languages: 

{\-x = y)^ {VI : I{x) = I{y)) (15) 



6 Interpretation Respects Simulation 

Our last observation is that interpretation respects simulation equivalence: 

a; = y ^ I{x) = I{y) (16) 

The proof depends on the following property of interpretations (proved in section 
((B|) by induction on x): 

I{x) = {□} U {iJa.ze.'I{a){I{z))) (17) 

To prove ([TB)) . we first switch to an interpretation J where (Va : □ ^ J{<i))- Let 
/ be a fresh unary function symbol, and define J by 

J(a)- /(a) -{□}□{/(□) I □£/(«)} 

For any regular expression z, it is easy to show that I{z) can be computed from 
J{z) by repeatedly replacing /(t) with t. Thus, it suffices to prove a; = y => 
J[x) = J[y). 

We prove this by proving the more general (Vt, x,y : t £ J{x) — J{y) ^ x ^ 
y), by induction on |<|. For t = □, 

□ e J{x) ^ {(HZl) } 

V (3 a,u : It e A □ e J(a)(J(u))) ^ {□ ^ J(a) } 

{-j(y), m} 

xiA^yi => {dcf < } 

X ^y 
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For t ^ □, 

teJ{x)-J{y) {(HZl), t^a} 

t€{Ua,^(^x'J{a){J{z)))At^{Ua,zev'J{a){J{z))) ^ {U } 
{3 a,u : u e x' At e J{a){J{u)) A {Wv e y' : t ^ J{a){J{v)))) 

^ {a^J{a) } 
(3 a, u, to : \tO\ < \t\ Au e x' AtO & J{u) A (Vi> £ y' : tO Jiv))) 

{indue hyp } 

{3 a,u : u e x' A (iv e y' : u ^ v)) => {dcf < } 

x' ^y' => {dcf < } 

x^y 



7 Completeness 

The completeness of WKA for simulation 

X = y ^ h X — y (18) 

now follows: 

x = y {(fTC])} 
(V/ : I{x) = I{y)) ^ 

h x^y 
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A Soundness of WKA for Simulation 



m ■■ 

{x + {y + z))i 

{{x + y) + z)i 

{x + (y + z))' 
x' Uy' U z' 
((a; + y) + z)' 

(x + x) i 

xi 

[x + x)' 
x' 



^ {{u + v)i} 

= {{u+vy} 

= {{u + v)'} 

O {(u + v)'} 
= {(u + v)'} 



m ■■ 

{x + y)l ^ {{u + v)l} 

xiyyi ^ {{u + v)i} 

{y + x)i 

(x + y)' ^ {{u + v)'} 

x'Uy' = {(u + v)'} 
(y + x)' 



Q: 

{x + 0)i ^ {{u + v)l} 
xlWOi 4^ {-(Oi) } 
xl 
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{x + 0)' 
x' 



{{u + v)'} 
{0' } 



[ix)' = {{uvy,v, 11} 



{xl)i ^ {iuv)l,ll} 

{x 1)' ^ {{u v)\ 1'} 

x' 1 - {di } 

x' 



OjAxi ^ {-(Oi)} 
Oi 



(0 x)' 

0' xUOix' 

{} 
0' 



{(" «)' } 

{0', -(Oi)} 
{0' } 



{{xy)z)i ^ {iuv)l} 
xlAyiAzi <=> {{uv)i} 
{x {y z))i 



{{x y) z)' 

{x y)' zyj{x y)iz' 

{x' yUxly') zUxlylz' 

{x' y) zVJxiiy' ziJyiz') 

x' [y z)Uxliy' zUyiz') 

x' {y z)Uxl {y z)' 

{x (y z))' 



{{u vY } 

{{u v)', {u v)i} 

{distr } 

m } 



{{u v)'^ 
{{u v)' 
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{{x + y) z)i ^ {{uv)i, {u + v)i} 

{xlyyi)/\zl ^ {logic } 

(xlAzDV (ylAzi) ^ {iuv)l, (u + v) 1} 
(x z + y z)l 

{{x + y)zy = {{uv)' 

{x + y)' zL> {x + y) iz' = {{u + v)', {u + v)i, 

(x' Uy') zUxiz' Uylz' = {distr 

{x' zUxlz')U{y' zUyiz') = {{u v)' 

{x z) U {y z) = {{u + v) 

{x z + y z)' 



{xy + xz)l 44> {{u + v) I, {uv)l} 

(s^iAyi) V(xi Azi) ^ {logic } 

xiA{yiVzi) ^ {{u + v)i, {uv)i} 
{x {y + z))i 

{x y + x z)' = {{u*)' } 

x' yUxiy'Ux' zUxiz' = {U } 

x' y U x' z U X ly' U X I z' = {distr } 

x' yUx' zUxi{y + z)' ^ {^ } 

x' {y + z) U X l{y + z)' = {{u v)'} 
{x {y + z))' 

x*i ^ {u*l } 

true ^ {{u + v)i, 11} 

{1 + x x*)i 

X*' = {u — V lu C u} 

x*'[Jxlx*' = {u*' } 
x'x*Llxlx*' = {{u v)' } 
{XX*)' = {!' } 

l'U{xx*)' = {{u + v)' } 
{1 + x X*)' 

(fT2|) : We prove soundness of the stronger axiom 
in])' y X < x\- z {y*x) < z x 
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Assuming y x < x, 

{ziy*x))l ^ {{uv)lu*l} 

zlAxi ^ {{uv)i } 

{z {y*x))' = {{u v)' , distr } 

z' {y*x)Uzly*' xUziy*ix' ^ {u*' , u* i } 

z' iy*x)Uzl {y' y*) xUzix' < {§1 } 

z' iy*x)Uzly' {y* x)Uzi x' r< {y x < x; } 

z' X U z I y' x U z I x' < {y X < X, so y' x < x'} 

z' xU z I x' — {{u v) } 
(z a;)' 



B Interpretations as Trees 

Here, we prove (fT7|) : 

I{x)^xi{a}U{UaIaix)) 

where 

laix) - (U,e.'/(a)(/(^))) 
We prove this by induction on the structure of x. 

x^O: 

J(0) = {def I } 

{} = {-(Oi) } 

Oi{n} = {0' = {}, so /a(0) = {}} 

Oi{n}u(u„/„(0)) 

x = l: 

7(1) = {def I } 

{□} = {li } 

U{°} = {!' = {}, so /a(0) = {}} 
U {□}□(□„/„(!)) 

I{b) ' = {def } 

/(6)({n}) = {u } 

(Ua=j,/(a)({n})) = {a- fe^ &' = {□}} 

(Ua=./(a)(6')) = {U } 

(Ua=b(U,eb./(a)(/(z)))) = {a7^fe^6' = {} } 
{Ua{U,eb'I{a){I{z)))) = {def 4 } 

(Ua/a(6)) = {-(&i) } 

&i{n}U(Ua/a(6)) 
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X ~ u + V : 

I{u + v) = {def / } 

I{u) U I{v) = {indue hyp } 

iul{a}U{UaIa{u)))U{vl{a}U{UaIa{v))) = {U } 

{ul{a}Uvl{a})u{iUaIaiu))UiUaIa{v))) = {distr, {x + y) 1} 

{U + V)i {a} [J{U Ja{u))U{UaIa{v)) = {U } 

iu + v)l{n}U{Ua{Iaiu)Ula{v))) = {dcf/„ } 

{u + v)l{a}u{Ua{U,^u'uv'Ha){Iiz)))) = {{x + yY } 

(w + t;)i{n}U(Ua(U,g(„+„)'/(a)(/(z)))) = {def /„ } 

{u + v)i{n}L\{\JaIa{u + v)) 



I{uv) = {def / } 

I{u){I{v)) = {indue hyp} 

{ul{a}UiUaIa{u))){I{v)) = {distr } 

ul{a}{I{v))U{UaIaiu))iI{v)) = MS) = S } 

uil{v) U {[JaIa{u)){I{v)) = {indue hyp} 

ul{vl{n}U{UaIaiv)))U{UaIa{uW{v)) = {distr } 

ulvl{a}U{UaIa{u)){Iiv))Uul{UaIaiv)) = {{x y) I } 

{uv)i{n}u{UJa{u)){I{v))Uui{UJa{v)) = {(a) below } 
{u v)i{a}u{uj^{u v)) 



(a): 

{UaIa{u)){I{v))Uul{UaIa{v)) = {U, defJa} 

{UaiU,eu'Iia)iI{z)){I{v))))Uui{UaIa{v)) = {() assoc } 

iUaiU,eu'I{a){Iiz){I{v)mUuiiUaIa{v)) = {I{xy) } 

(U„(U^e„./(a)(/(z w)))) Uwi(U„/„0;)) = {U } 

iUa{Dze{u'v)Iia){I{z))))Uui{UaIa{v)) = {distr } 

(Ua(U,e(„,,)/(a)(/(z))))U(Uaui/a(f)) = {distr } 

(Ua(U,e(„,,)/(a)(/(z)))Uui/a(f)) = {def 4 } 

{Lia{l)ze{u'v)I{a){I{z))))Uui{U,ev'I{a){I{z))) = {distr } 

(Ua(U,6(„.,)/(a)(/(z))))U(U,e^„./(a)(/(^))) = {distr } 

{u' vLinl v' )I{a){I{z)))) = {{xy)' } 

(U„(U,e(„ ^yl{a){l{z)))) = {def 7„ } 

{\JaIa{u V)) 



11 



Let n and m range over natural numbers; defining Um = -f((l + u)"'), 

I{u*) = {I{x*) } 

(U„[/„)) = {(a) below } 

(U„({n}U(Ua/a(?i))(U™<„?7™))) = {distr } 

{a}U{UniUaIaiu))iUm<nUm)) = {(b),(a) beloW } 

{□}u(u,4(w))(u„(u™<„;7„)) = {u } 

{□}U(U,4(w))(U„C/„) = {/(x*) } 

{□}U(Ua/a («))(/(«*)) = {X*i } 

U(Ua/a(u))(/(w*)) = {def 7a } 

un{n}U(Ua(U,e„./(a)(/(z))))(/(u*)) = {distr } 

U(Ua(U,e„,(/(a)(/(z)))(/(w*)))) = {assoc } 

u*i{a}UiU,{U,eu'I{a){Iiz)iIiu*m) = {lix y) } 

U (Ua(U,g„./(a)(J(z u*)))) = {U } 

«n{n}U(U„(U,e(„,„.)/(a)(/(z)))) = {x*' } 

wn{n}U(U„(U,e„w7(a)(7(z)))) = {def 7„ } 

u*i{0}U{UaIa{u*)) 

(a) Un = {□} U iLlaIaiu)){Llm<nUm) 

Proof by induction on n. For n = 0, this follows from 7(1); for n > 0, 

U„+i = {(c) below } 

UnUiUaIa{u))iUn) = {iuduC hyp } 
{a}U{UaIa{u)){Um<nUm)U{UaIa{u)){Un) = {C/m C [/„ (c) } 
{□} U {UaIa{u))iUm<n+lUm) 

(b): For U-increasing sequence of languages T„, 

(U„>o7.(r„)) = L(U„>or„) 

(because each tree has a finite number of n's) 

(C) U„+i =UnU (Ua7„(u))(C/„) 



({□}u(«i{D}u(Ua7„(«))))(f/„) = {«{{□} c{n} } 

({□}U(U„7,(,i)))(f/„) = {U, def □() } 

UnU{UaIa{u)){Un) 
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